Temporal characteristics of emphasis in continuous speech
نویسندگان
چکیده
The present study examines how global tempo adjustment can reflect the allocation of emphasis, whether emphasis is a local prosodic phenomenon, whether the degree of perceived emphasis corresponds systematically to speech signal, and whether temporal features can be derived from production analysis. Results from acoustic analysis showed positive correlations between perceived emphasis to both local and global tempo modulations; higher emphasis of higher degree corresponds to overall tempo slowing while duration adjustment of individual phones is independent of segmental make-up. To demonstrate how global tempo modulations by utterance from discourse information may affect local tempo adjustment of by words, we normalized all possible effects of discourse factors and found sharper contrasts between emphasis and non-emphasis. The present results suggest that (1) emphasis should not be treated as a local prosodic phenomenon in continuous speech and (2) emphasis can be better understood by degree of contrast .
منابع مشابه
Effects of ageing on speed and temporal resolution of speech stimuli in older adults
Background: According to previous studies, most of the speech recognition disorders in older adults are the results of deficits in audibility and auditory temporal resolution. In this paper, the effect of ageing on timecompressed speech and auditory temporal resolution by word recognition in continuous and interrupted noise was studied. Methods: A time-compressed speech test (TCST) w...
متن کاملPhoneme Classification Using Temporal Tracking of Speech Clusters in Spectro-temporal Domain
This article presents a new feature extraction technique based on the temporal tracking of clusters in spectro-temporal features space. In the proposed method, auditory cortical outputs were clustered. The attributes of speech clusters were extracted as secondary features. However, the shape and position of speech clusters change during the time. The clusters temporally tracked and temporal tra...
متن کاملSpeech Emotion Recognition Using Scalogram Based Deep Structure
Speech Emotion Recognition (SER) is an important part of speech-based Human-Computer Interface (HCI) applications. Previous SER methods rely on the extraction of features and training an appropriate classifier. However, most of those features can be affected by emotionally irrelevant factors such as gender, speaking styles and environment. Here, an SER method has been proposed based on a concat...
متن کاملWord segmentation in Persian continuous speech using F0 contour
Word segmentation in continuous speech is a complex cognitive process. Previous research on spoken word segmentation has revealed that in fixed-stress languages, listeners use acoustic cues to stress to de-segment speech into words. It has been further assumed that stress in non-final or non-initial position hinders the demarcative function of this prosodic factor. In Persian, stress is retract...
متن کاملNew temporal features for robust speech recognition with emphasis on microphone variations
Although the delta and RASTA methods have been widely used in extracting the temporal properties of stationary features for robust speech recognition, there is still a need to investigate new temporal features for better performance. In this paper, we present two new temporal features for robust processing of speech signals with emphasis on microphone variations. First, the temporal feature is ...
متن کامل